A Randomized Algorithm for Principal Component Analysis

نویسندگان

  • Vladimir Rokhlin
  • Arthur Szlam
  • Mark Tygert
چکیده

Principal component analysis (PCA) requires the computation of a low-rank approximation to a matrix containing the data being analyzed. In many applications of PCA, the best possible accuracy of any rank-deficient approximation is at most a few digits (measured in the spectral norm, relative to the spectral norm of the matrix being approximated). In such circumstances, existing efficient algorithms do not guarantee good accuracy for the approximations they produce, unless one or both dimensions of the matrix being approximated are small. We describe an efficient algorithm for the low-rank approximation of matrices that produces accuracy very close to the best possible, for matrices of arbitrary sizes. We illustrate our theoretical results via several numerical examples.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sparse Structured Principal Component Analysis and Model Learning for Classification and Quality Detection of Rice Grains

In scientific and commercial fields associated with modern agriculture, the categorization of different rice types and determination of its quality is very important. Various image processing algorithms are applied in recent years to detect different agricultural products. The problem of rice classification and quality detection in this paper is presented based on model learning concepts includ...

متن کامل

Learning Bayesian Network Structure Using Genetic Algorithm with Consideration of the Node Ordering via Principal Component Analysis

‎The most challenging task in dealing with Bayesian networks is learning their structure‎. ‎Two classical approaches are often used for learning Bayesian network structure;‎ ‎Constraint-Based method and Score-and-Search-Based one‎. ‎But neither the first nor the second one are completely satisfactory‎. ‎Therefore the heuristic search such as Genetic Alg...

متن کامل

Modelling of some soil physical quality indicators using hybrid algorithm principal component analysis - artificial neural network

One of the important issues in the analysis of soils is to evaluate their features. In estimation of the hardly available properties, it seems the using of Data mining is appropriate. Therefore, the modelling of some soil quality indicators, using some of the early features of soil which have been proved by some researchers, have been considered. For this purpose, 140 disturbed and 140 undistur...

متن کامل

Principal Component Analysis for Soil Conservation Tillage vs Conventional Tillage in Semi Arid Region of Punjab Province of Pakistan

Principal component analysis is a valid method used for data compression and information extraction in a given set of experiments. It is a well-known classical data analysis technique. There are a number of algorithms for solving the problems, some scaling better than others. Wheat ranks as the staple food of most of the nations as well as an agent of poverty reduction, food security and world ...

متن کامل

Implementation of Face Recognition Algorithm on Fields Programmable Gate Array Card

The evolution of today's application technologies requires a certain level of robustness, reliability and ease of integration. We choose the Fields Programmable Gate Array (FPGA) hardware description language to implement the facial recognition algorithm based on "Eigen faces" using Principal Component Analysis. In this paper, we first present an overview of the PCA used for facial recognition,...

متن کامل

Dimensionality Reduction Using a Randomized Projection Algorithm: Preliminary Results

We describe an implementation and experiments with a low-distortion randomized projection algorithm [LINI94] that can reduce the number of dimensions in the data by a considerable amount. The performance of the randomized algorithm is compared with that of a popular technique---Principal Component Analysis (PCA). The experiments show that the randomized projection algorithm consistently outperf...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • SIAM J. Matrix Analysis Applications

دوره 31  شماره 

صفحات  -

تاریخ انتشار 2009